Probabilistic D-Clustering

نویسندگان

  • Adi Ben-Israel
  • Cem Iyigun
چکیده

We present a new iterative method for probabilistic clustering of data. Given clusters, their centers and the distances of data points from these centers, the probability of cluster membership at any point is assumed inversely proportional to the distance from (the center of) the cluster in question. This assumption is our working principle. The method is a generalization, to several centers, of the Weiszfeld method for solving the Fermat–Weber location problem. At each iteration, the distances (Euclidean, Mahalanobis, etc.) from the cluster centers are computed for all data points, and the centers are updated as convex combinations of these points, with weights determined by the above principle. Computations stop when the centers stop moving. Progress is monitored by the joint distance function a measure of distance from all cluster centers, that evolves during the iterations, and captures the data in its low contours. The method is simple, fast (requiring a small number of cheap iterations) and insensitive to outliers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Probabilistic Clustering Algorithms to Determine Mineralization Areas in Regional-Scale Exploration Studies

In this work, we aim to identify the mineralization areas for the next exploration phases. Thus, the probabilistic clustering algorithms due to the use of appropriate measures, the possibility of working with datasets with missing values, and the lack of trapping in local optimal are used to determine the multi-element geochemical anomalies. Four probabilistic clustering algorithms, namely PHC,...

متن کامل

Probabilistic Distance Clustering: A Spatial Clustering Approach for Remote Sensing Imagery

Spatial clustering has been widely applied in various applications, especially in remote sensing technology. Clustering the geographical nature of the remote sensing imagery is challenging due to its wide and dense spatial distribution. Renowned clustering algorithms such as k-means and other probabilistic clustering algorithms have been reported in the literature. However, they are not robust ...

متن کامل

Probabilistic Distance Clustering: A Spatial Clustering Approach for Remote Sensing Imagery

Spatial clustering has been widely applied in various applications, especially in remote sensing technology. Clustering the geographical nature of the remote sensing imagery is challenging due to its wide and dense spatial distribution. Renowned clustering algorithms such as k-means and other probabilistic clustering algorithms have been reported in the literature. However, they are not robust ...

متن کامل

Probabilistic Distance Clustering: A Spatial Clustering Approach for Remote Sensing Imagery

Spatial clustering has been widely applied in various applications, especially in remote sensing technology. Clustering the geographical nature of the remote sensing imagery is challenging due to its wide and dense spatial distribution. Renowned clustering algorithms such as k-means and other probabilistic clustering algorithms have been reported in the literature. However, they are not robust ...

متن کامل

Probabilistic Distance Clustering: A Spatial Clustering Approach for Remote Sensing Imagery

Spatial clustering has been widely applied in various applications, especially in remote sensing technology. Clustering the geographical nature of the remote sensing imagery is challenging due to its wide and dense spatial distribution. Renowned clustering algorithms such as k-means and other probabilistic clustering algorithms have been reported in the literature. However, they are not robust ...

متن کامل

Probabilistic Distance Clustering: A Spatial Clustering Approach for Remote Sensing Imagery

Spatial clustering has been widely applied in various applications, especially in remote sensing technology. Clustering the geographical nature of the remote sensing imagery is challenging due to its wide and dense spatial distribution. Renowned clustering algorithms such as k-means and other probabilistic clustering algorithms have been reported in the literature. However, they are not robust ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Classification

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2008